A Novel Two-Phase SOM Clustering Approach to Discover Visitor Interests in a Website
نویسندگان
چکیده
Mining content, structure and usage data in websites can uncover browsing patterns that different groups of Web visitors follow to access the subjects that are truly valuable to them. Many works in the literature focused on proposing new similarity measures to cluster Web logs and detect segments of browsing behaviors. However, this does not reveal which contents the visitors are interested in since a Web page may contain many different topics. In this paper, a novel two-phase clustering approach based on Self Organizing Maps (SOM) is proposed to address this problem. A systematic process to prepare Web content data for clustering is also described.
منابع مشابه
NGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملسیستم پیشنهادگر هوشمند برای خردهفروشی اینترنتی با استفاده از نقشه خودسازمانده و قواعد انجمنی بر اساس الگوهای جمعیتشناختی مشتریان
The intensive competition in e-Commerce causes effective methods for customer attraction of special importance. In this regard, the recommender systems in commercial websites can precisely determine customers' interests and needs, and offer them most suitable products and services. In this paper, a new model for recommender systems is proposed which segments the market and customers more effi...
متن کاملA Novel Fault Detection and Classification Approach in Transmission Lines Based on Statistical Patterns
Symmetrical nature of mean of electrical signals during normal operating conditions is used in the fault detection task for dependable, robust, and simple fault detector implementation is presented in this work. Every fourth cycle of the instantaneous current signal, the mean is computed and carried into the next cycle to discover nonlinearities in the signal. A fault detection task is complete...
متن کاملAn Efficient Machine Learning Regression Model for Rainfall Prediction
Interfacing through the continuously rising amounts of data in technical, medical, scientific, engineering, industrial and monetary fields and their renovation to logical form for the human user is one of the main requirements. To quickly discover and analyze complex patterns and requirements, we need the efficient techniques and need to learn from new data will be necessary for information-int...
متن کاملHierarchical Representatives Clustering with Hybrid Approach
Clustering is a discovering process of meaningful intbrmation by grouping similar data into compact clusters. Most of traditional clustering methods are in favor of small datasets and have difficulties handling very large datasets. They are not adequate clustering methods for partitioning huge datasets in data mining perspective. We propose a new clustering technique, HRC(hierarchical represent...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010